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Abstract — Today’s software industry is all about efficiency. 
As technology trends are changing rapidly, implementation of 
more complex system at a cheaper cost & also maintenance of 
better quality at the same time are crucial challenges for 
software companies. Researchers have proposed numerous 
mechanisms for software cost estimation. This paper provides 
general overview of existing software cost estimation models and 
techniques. It also highlights strengths and weakness of various 
popular methods and covers the latest trends in this field.. 

Index Terms — COCOMO , Estimation Process, Software 
Cost, SLOC, Software Metrics 

I. INTRODUCTION 

From the beginning of computer era, estimation of cost & 
effort involved in software development has been an 
important & challenging task. Software industry is getting 
more seasoned & complex these days because the size and 
importance of software applications have grown a lot. 
Without a doubt it is now the driving force of industry area, 
government & military operations, modern businesses, 
scientific, medical & technical fields. The vital link between 
the general concepts and techniques of economic analysis and 
the particular world of software engineering is provided by 
software engineering cost estimation. In order to make good 
management decisions and for accurately determining how 
much time, effort and resources are required, precise 
prediction of software development cost is must thus while 
software development one of the most crucial task is 
estimation. Many factors are responsible for accuracy of any 
project like size and level of complexities of project, business 
plans, resources required, resources used, impact of changes 
and re planning, customer expectations. According to studies 
most projects (60-80%) encounter effort and/or schedule 
overruns. Several models and techniques are available still 
accurate prediction is a challenge for analyst, software 
managers and researchers. 

A. SOFTWARE COST & ESTIMATION PROCESS 

The process of predicting how many resources and how 
many hours are needed to develop a software project is called 

SOFTWARE COST/ EFFORT ESTIMATION. 

SOFTWARE COST comprises 3 major elements namely- 

• Manpower 

. Effort 

• Duration 
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According to NASA’S guide on s/w cost estimation the basic 
equation for software cost is as follow (NASA, 2003): 
Software development labor costs + other labor costs + Non- 
labor costs = Total Software costs 

Software development labor costs - The labor, resources, 
time requires during different development phases like 
functional design, code development, requirement analysis, 
interface specification etc . . . 

• Other labour costs - It covers costs involved during s/w 
quality assurance, test bed development, assembly, test 
& launch operations, verification & validation 
operations etc. . . 

• Non-labor costs - Costs involved in training, travel & 
trips, software procurement, support & services etc. . . 

B. Reasons for failure of IT Projects 

• Lack of knowledge & experience in estimation 

• Lack of data, lack of time & resources to perform 
estimate accurately 

• Poor user input and vague requirements 

• Conflicts among stakeholders 

• Rapid changes in IT and methodologies 

• Poor architecture 

• Improper planning, improper selection of SDLC model 
during development phases 

Poor risk management 

C. Importance of Good Estimation 

Typically 4 major variables- time, requirements, resources 
(people, infrastructure/materials and money) and risks 
controls software projects. If any of these encounter 
unexpected changes, its impact will be on project. Both 
underestimation and overestimation of project needs can 
cause major problems. 

• It can help to categorize and prioritize development 
projects according to an overall business plan. 

• It is always helpful in assessing the impact of changes 
and helps in preplanning. 

• When resources are better matched to real needs, 
projects can be easily managed and controlled. 

• It helps in deciding what resources are needed to 
commit to the project and how well these resources will 
be used 

II. BACKGROUND 

Before 1970, Thumb rules or some algorithms which were 
based on Trial and error were used for effort estimation [9]. 
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1970 was a vital period to anticipate the expenses and 
schedules for software development. Computerized Software 
estimation tools were constructed. Some difficulties were 
experienced while building large software systems [16]. 
During mid 1970's the first automated s/w approximation tool 
had been flesh. The prototyping composite model is 
COCOMO (Constructive Cost Model) developed by Barry 
Boehm and is portrayed in book Software Engineering 
Economics [9]. In 1975, based on five different attributes 
namely-Inputs, Output, Inquires, Logical Files, Interfaces .A 
new Function Point Analysis approach was developed for 
estimation of size and development effort [2], In 1977, Frank 
Freiman designed PRICE-S Software estimation model. In 
1979, Lawrence H. Putnam introduced SLIM (Software Life 
Cycle Model) to US-Market [21]: This model was based on 
Norden Rayleigh Curve. In 1983, Ada Programming language 
was introduced by DOD (U.S. Department of Defense). 
Ada-COCOMO model was build which reduced developing 
cost of large systems [22]. 1981, Dr. Barry Boehm 
highlighted the essential algorithms of Constructive Cost 
Model (COCOMO) through his book “Software Engineering 
Economics”. During the same year Allan Albrecht published 
an article to the FPA method. This article sharpened the rules 
for rating the complexity of software [9], In 1982, Tom de 
Marco introduced a functional metric that inherited some of 
the features of Albrecht’s function point, but was developed 
independently. A book “controlling software projects” was 
released by him for introducing this metric. In 1983, Mark II 
function point metric was introduced by a British software 
estimating researcher Charles Symons [12], In 1984 a major 
revision of function point metric was done by IBM which is 
basis of today’s function points. 1985, In order to include the 
effect of computationally complex algorithms, concept of 
Function Point was extended by Caper Jones [3]. In 1986, 
IFPUG (International Function Point Users Group) was 
established in Toronto, Canada because of quickly 
developing utilization of Function Point Metrics. 1990, Barry 
Boehm, at college of Southern California started to revise and 
expand the idea of original COCOMO model. 1991 , Michel 
van Genuchten and Hans Koolen they added to various 
techniques and tools which were created over number of years 
to meet the expanding need to control programming 
advancement [10]. 1992, Betteridge, R. worked on software 
costing. A method called Mark II Function Point was used to 
predict cost of number of projects [8], 1993, COCOMO 2.0 
the new version of COCOMO was introduced which was 
emerged in 1994 [7], 1994, Rajiv D Banker and Hsihui Chang 
and Chris F Kemerer, they thought that it was helpful for 
expense estimation and profit assessment purposes' to 
consider software development as a economic production 
process[4], 1996, from the early system specifications Sophie 
Cockroft obtained accurate size estimations [14], In 1997, 
techniques were more focused on accuracy and existing 
models were reviewed. In 1998, a new model called MARCS 
was constructed by Chatzoglou, to give predictions of the 
resources (time, effort, cost, and people) [13], In 1999, J. J. 
Dolado, made a research using the technique of Genetic 
Programming (GP) for exploring possible cost functions [15]. 
In 2001, new approach was proposed which was based on 
reasoning by analogy and to estimate the effort linguistic 
quantifiers were used [1]. In 2002, M. Jorgensen, expert 
estimation was the most frequently applied estimation 
strategy for software projects [18]. In 2003, Yunsik Ahn, 


Jungseok Suh, Seungryeol Kim and Hyunsoo Kim, proposed 
SMPEEM (Software Maintenance Project Effort Estimation) 
[25], In 2004,Barbara proposed the idea of EBSE (Evidence 
based Software Engineering) [6] .In 2005, sequence was 
decided and needed to be carried out for software estimation 
Sizing Project deliverables, Estimating quality and defect 
Removal efficiency. Selecting Project activities. Estimating 
staffing levels, Estimating Effort, Estimating Costs, 
Estimating Schedules, Estimating requirements growth 
during development [11]. 2006, Stein Grimstad, effort 
estimate was frequently used without sufficient clarification 
of its meaning, and that estimation accuracy is often evaluated 
without ensuring that the estimated and actual effort were 
comparable[23].In 2007, for effort estimation different 
methods were introduced. The average accuracy of effort 
estimates based on expert judgment was higher than the 
average accuracy of models. In 2008, Parvinder S. Sandhu 
focused on predicting the accuracy of models. As a soft 
computing approach, neuro-fuzzy system was used to 
generate the model because Neuro-Fuzzy system was able to 
approximate the non-linear function with more precision [20]. 
2010, In order to reduce the error and to minimize the changes 
of estimates from actual different estimation techniques were 
combined [19, 24], 2011, numerous estimation techniques 
were proposed and used extensively by practitioners for use in 
Function Oriented Software development. 2012, A lot of 
commercial software costs estimating tools have been 
released till today.. 

III. Review of Existing Methods 
To improve the correctness of s/w development effort 
estimation several techniques and models have been proposed. 
These methods includes- Algorithmic estimation, analogy 
based estimation, data mining techniques, soft computing 
techniques, artificial neural network based techniques, expert 
judgment based techniques. This section enlists some of them 
along with their comparative advantages and disadvantages. 

A. Algorithmic Estimation 

It uses mathematical equations to perform software 
estimations. These mathematical formulae relates 
independent variables( like cost drivers) to dependent 
variables (like effort, cost). Source lines of codes 
(SLOC), number of functions and other cost drivers such as 
languages, methodologies, risk assessment etc... are taken 
into account in this kind of estimation method. Model based 
on Algorithmic estimation are summarized in below table- 
TABLEI 

Model based on Algorithmic estimation 


Model 

Effort Equation 

Description 

COCOMO 

Model 

E= a 

x(KLOC)*b x 
EAF 

Developed by 
BOHEM, constant 
value a, b, depends 
on project type 
weather it is 
organic 

semi-detached or 
embedded. 
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SLIM Model 

Technical 

constant, 

C = Size x 

Bl/3 x T4/3 

Total person 
months, B=l/T4 
x (Size/C)*3. 
T=Development 
time in years. 

C= Parameter 
dependent on 
development 
environment. 

It is empirical 
effort estimation 
model, developed 
by Lawrence H 
Putnam in 1978.lt 
provides 

description of Time 
and effort needed 
to complete a 
software project of 
specified type. 

Walston-Felix 

Model 

E = 5.2 (KLOC) 
0.91 

D = 4.1 (KLOC) 
0.36 

Developed by 
C.E.Walston and 
C.P. Felix in 1977. 

It is method of 
programming 
Measurement and 
estimation. 

Albrecht-Gaffne 
y MODEL 

It uses Function 
point to estimate 
efforts. 

Developed by IBM 
DP Services 
Organizations. 

Kemerer 

MODEL 


It is cost estimation 
model uses 

Function points and 
Linear Regression. 


B. Analogy based Estimation 
It compares new projects with similar projects from the past, 
make relationship and find similarity in order to find accurate 
result. 

C. Data mining Techniques 
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These techniques transforms large and complex data into 
meaningful patterns and rules. Regression and Classification 
are some basic operation of data mining. 

D. Rule Induction 

It is particular aspect of inductive learning in which rules are 
produced by algorithms as a result of modelling. These rules 
are transparent and therefore can be read and understood 
easily, it is the advantage of inductive learning over neural 
network based learning. 

E. Artificial Neural Network based Estimation 

To find accurate estimate for software building efforts, 
Machine learning and Pattern Recognition methodology are 
used. ANN can learn from previous data and is able to find 
relationship between dependent and independent variables. 

F. Function Point analysis based Model 

At first in 1983, to measure the functionality of project 
Albrecht presented Function point analysis, this method 
measure the size of software, it considers internal logical files, 
external interface files, external input-output, external 
inquiries from functional viewpoint metric. ESTIMACS and 
SPQR/20 are the models which adopt FPA approach of 
estimation. 

G. Soft Computing Techniques 

Basically it is a consortium of methodologies cantering in 
artificial neural networks, fuzzy logic, and evolutionary 
computation. Particle Swarm Optimization, Ant Colony 
Optimization, Genetic Programming, fuzzy Logic etc... 
comes under this section, these methodologies provide in one 
form or another flexible information processing capability for 
managing real fife ambiguous circumstances. These are 
complementary and synergistic, rather than competitive. 


TABLE II 

Comparative Advantages and Limitations Of Existing Methods 


METHODS 

TYPE 

KEY ADVANTAGES 

DISADVANTAGES 

COCOMO 

Algorithmic 

Very common approach provides 
clear results. 

This model is not suitable 
for many projects as large 
amount of data is required. 

Neural 

network 

based 

estimation 

methods 

Non-algorithmi 

c 

These methods provide power of 
reasoning and are consistent with 
unlike databases. 

Large training data is 
required. Lack of adequate 
amount of data set effects 
performance, no guidelines 
are available for designing. 

Function 
point analysis 

Algorithmic 

Results are better than SLOC, 

Language independent, Since 
function points are based on system 
users external view of system, 

Non- tech users have a better 
Understanding of what FP are 
measuring. 

Resulting metrics are straightforward and 
logical. 

Mechanization is hard to do 
as precise counting require 
In-depth knowledge of 
standards. 

Analogy 

Non-algorithmi 

Having special experts is not 

A lot of information about 
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based 

estimation 

c 

important, works based on 
actual experiences. 

past similar projects is 
required. In some situations 
similar projects are not 
available. 

Putnam 

model 

Algorithmic 

This model is basically based on 2 
variables which are time and size. 

This model does not 
consider all other aspects of 
software development life 
cycle. 

Fuzzy logic 
based 
estimation 

Non-algorithmi 

c 

This approach is capable 
to handle uncertainty and 

provides reliable estimates, no training is required, 
other advantage is its 
flexibility. 

This method is hard to use, 
estimation of complex 
features cost is much 
tedious. 


IV. Recently Developed Different Approaches and 
Findings 

It has been observed that all estimation methods are 
specific for some specific type of projects. Every method or 
model has its own significance and importance therefore it is 
very hard to make a decision which method is better than to all 
other methods. This section summarizes the recent research, 
researchers worked with another field along with the software 
engineering like data mining and machine learning techniques 
for improving the accuracy of software cost estimation 
process. 

TABLE mi 

Different Cost Estimation Methods 


Researchers 

Different Approaches For 
Software Cost Estimation And 
Findings 


Witting and 
Finnie 

In order to predict software 
development effort describe 
use of back propagation 
learning algorithms on a 
multilayer perception. 

(26), 

(27) 

Lefley and 
Shepperd 

They applied the concept of 
genetic programming to 
improve software cost 
estimation on public datasets 
with great success 

(28) 

Prasad Reddy et 
al. 

Used the concept of Multi 
Objective (MO) Particle 
Swarm Optimization and 
proposed a model for 
software cost estimation 

(29) 

Vinaykumar et al 

For the prediction of 
software cost estimation used 
wavelet neural networks 

(30) 

Oliveira 

This work is based on 
comparative study on 

support vector regression 
(SVR), radial basis functions 
neural networks (RBFNs) 
and linear regression for 
estimation of software 
project effort and result 
clears it that SVR 
significantly outperforms 

RBFNs and linear 

regression. 

(31) 


Pahariya et al 


Reddy et al 


Andreou et al 


Sweta and 
Pushkar 


Described a new 
computational intelligence 
sequential hybrid 

architecture which includes 
programming and Group 
Method of Data Handhng 
(GMDH). Data mining 
methods such as Radial Basis 
Function (RBF) Multi-Layer 
Regression (MLR), and so on 
are included in this work. 


By applying Gaussian 
membership function which 
provide better performance 
than the trapezoidal function 
to presenting cost drivers this 
work enhanced fuzzy 
approach for software effort 
oftheCOCOMO. 

This work considered Fuzzy 
Decision Trees (FDTs) for 
estimating required effort 
and software size in cost 
estimation as if strong 
evidence about those fuzzy 
transformations of cost 
drivers contributed to 
enhancing the prediction 
I process- * 

This work provides a 
comparative study on 
Intermediate COCOMO, 
support vector regression 
(SVR and Multiple 
Objective Particle Swarm 
Optimization (MOPSO) 
model for prediction of 
project effort and it has been 
observed through simulation 
it has been observed that in 
comparison of other 

estimating techniques SVR 
provides better result in 
terms of accuracy and error 
rate. 
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V. Current Trends in Software Cost Estimation 

A. Use ofSLOC/SDI 

The current trend is now trying to get away from SLOC/SDI 
and getting more focused on Function Points. The reason 
behind is that Function Points are more independent, less 
dependent on languages and programming environment as 
compared to SFOC/SDI. 

B. In House Metrics Development 

Nowadays, the majority of systems developers and 
consultants have a methodology to find out the a priori cost of 
a software development project, such cost estimation 
methodology is allied to a specific systems analysis and 
design methodology. This estimation of cost is based on the 
use of the analysis methodology, knowledge and experience 
of the firm. 

C. Prototyping 

Boehm and Papaccio's spiral development model is in essence 
a prototyping model in which a system is developed in phases, 
which includes requirements specifications, cost to 
completion, and the risk evaluated at each step. In recent years 
prototyping has become a major part of many systems 
developments efforts. 

D. Wide Commercial Industries of Estimation Tools 
Estimation of cost/effort while software development is a 
complex activity, there is a commercial industry of companies 
which are marketing software estimation tools. As of 2013, 
most widely used tools for estimation purpose are-COCOMO 
II, SEER, SLIM, Software Risk Master (SRM), and 
TruePrice. 


VI. Conclusion and Future Work 
A Process of estimation reflects the reality of project’s 
progress. It manages cost/budget & controls overruns. No 
single method is necessarily better or worse than the other, in 
actual, strengths and weaknesses of each are often 
complimentary to one other .This paper provides a review of 
different types of methods in software cost estimation. To 
produce meaningful and reliable estimates, knowledge of 
each technique and understanding of software attributes and 
their casual relationship is must. More research is considered 
necessary to sizing the software functional requirement 
directly once it stores in CASE tool, which will result to quick 
estimation and reduction in cost. In object-oriented CASE 
environments, object points is one of the capable and 
promising technique but more research in this field is 
required. 
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